A Recent Survey on Unstructured Data to Structured Data in Distributed Data Mining

نویسنده

  • M. Hemalatha
چکیده

The organization of unstructured data is recognized as one of the major uncertain problems in the information industry and data mining paradigm. It will be in the form of computerized information that moreover, does not have a data model and there are not simply used by data mining. The task of managing unstructured data signifies possibly the major data management opportunity for our community subsequently managing relational data. The communities users such as KDD, Semantic web, AI and web, to industrial users such as Google, Yahoo, and Microsoft.This paper presents a study and analysis of the unstructured to structured distributed data mining. Several methods are available to manage unstructured data. The results of this analysis show that, there is a lack in managing unstructured data into structured data. Here, given a discussion about the impact of unstructured data in distributed data mining.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A generalized Framework of Privacy Preservation in Distributed Data mining for Unstructured Data Environment

The management of unstructured data is recognized as one of the major unsolved problems in the information industry and data mining paradigm. Unstructured data in computerized information that either does not have a data model and there are not easily usable by data mining. This paper proposes a solution to this problem by managing unstructured data in to structured data using legacy system and...

متن کامل

A Study on Text Mining over Hadoop Framework ‖

In today’s scenario as data is increasing day by day so text data mining approaches are playing a vital role in extracting many potential information and association from a large amount of text data. The term data mining is used for methods that analyze data and data mining deals with structured data, whereas text mining presents different formats that are unstructured or semi-structured data. ...

متن کامل

In-depth Interactive Visual Exploration for Bridging Unstructured and Structured Document Content

Semi-structured data refers to the combination of unstructured and structured data. Unstructured data is free text in natural language, while structured data is typically stored in tables and following a data schema. Recent statistics shows that 80% of the data generated in the last two years is unstructured. However, one interesting observation is that free text usually comes along with some s...

متن کامل

Data Mining Architectures - A Comparative Study

Data mining is the process of deriving knowledge from data. The architecture of a data mining system plays a significant role in the efficiency with which data is mined. It is probably as important as the algorithms used for the mining process. CRITIKAL is a three-tier data mining architecture consisting of Client, Middle tier and the Data Warehouse. The architecture for mining semi-structured ...

متن کامل

Comparison of Structured vs. Unstructured Data for Industrial Quality Analysis

Industrial methods for quality analysis massively rely on structured data describing product features and product usage. The analysis of such data is normally done using complex reporting or sophisticated data mining methods. Besides this structured data, companies very often also posses large amounts of unstructured text like call center reports, internet fora or repair order documents. Despit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014